SUPPORT / SAMPLES & SAS NOTES
 

Support

Problem Note 54358: Incorrect score chi-square values with SELECTION=SCORE and extremely ill conditioned data

DetailsAboutRate It

When using the best subset selection method (SELECTION=SCORE option) in PROC LOGISTIC, if the data in the candidate predictors are extremely ill-conditioned then the score chi-square statistics reported in the "Regression Models Selected by Score Criterion" table might be incorrect. The data are ill-conditioned if there are constant variables, if the variables have a very wide variation in scales, or if there are highly-collinear/redundant variables. The problem can be avoided by removing variables causing the ill-conditioning, and then using the SELECTION=SCORE method on the remaining variables. To do this, remove any constant variables, use PROC STANDARD to standardize the variables if there is a wide difference in scales, and remove any variables that are redundant as illustrated below.

This PROC STANDARD step standardizes all numeric candidate variables in data set MYDATA.

      proc standard data=mydata out=stddata m=0 s=1;
        var <all-candidate-variables>;
        run;

Run PROC LOGISTIC on the standardized data using all of the desired candidate variables. Include an ODS OUTPUT statement to save the table of parameter estimates.

      proc logistic data=stddata;
        model <response-variable> = <all-candidate-variables>;
        ods output parameterestimates=pe;
        run;

The following DATA step creates a macro variable (UseTerms) that contains the list of nonconstant and nonredundant candidate predictors.

      %let UseTerms=;
      data _null_; 
        set pe;
        if df=1 and Variable ne "Intercept" then
        call symput("UseTerms",catx(" ",symget("UseTerms"),Variable));
        run;

Using only the standardized variables in the UseTerms macro variable avoids any ill-conditioning in the PROC LOGISTIC step that implements the SELECTION=SCORE model selection method.

      proc logistic data=stddata;
        model <response-variable> = &UseTerms / selection=score;
        run;


Operating System and Release Information

Product FamilyProductSystemSAS Release
ReportedFixed*
SAS SystemSAS/STATWindows 7 Professional x649.2 TS2M39.4 TS1M3
Windows 7 Professional 32 bit9.2 TS2M39.4 TS1M3
Windows 7 Home Premium x649.2 TS2M39.4 TS1M3
Windows 7 Home Premium 32 bit9.2 TS2M39.4 TS1M3
Windows 7 Enterprise x649.2 TS2M39.4 TS1M3
Windows 7 Enterprise 32 bit9.2 TS2M39.4 TS1M3
Microsoft Windows XP Professional9.2 TS2M3
Microsoft Windows Server 2008 for x649.2 TS2M39.4 TS1M3
Microsoft Windows Server 2008 R29.2 TS2M39.4 TS1M3
Microsoft Windows Server 20089.2 TS2M39.4 TS1M3
Microsoft Windows Server 2003 for x649.2 TS2M3
Microsoft Windows Server 2003 Standard Edition9.2 TS2M3
Microsoft Windows Server 2003 Enterprise Edition9.2 TS2M3
Microsoft Windows Server 2003 Datacenter Edition9.2 TS2M3
z/OS9.2 TS2M39.4 TS1M3
Microsoft® Windows® for 64-Bit Itanium-based Systems9.2 TS2M3
Microsoft Windows Server 2003 Datacenter 64-bit Edition9.2 TS2M3
Microsoft Windows Server 2003 Enterprise 64-bit Edition9.2 TS2M3
Microsoft Windows XP 64-bit Edition9.2 TS2M3
Microsoft® Windows® for x649.2 TS2M39.4 TS1M3
Windows 7 Ultimate 32 bit9.2 TS2M39.4 TS1M3
Windows 7 Ultimate x649.2 TS2M39.4 TS1M3
Windows Vista9.2 TS2M3
Windows Vista for x649.2 TS2M3
64-bit Enabled AIX9.2 TS2M39.4 TS1M3
64-bit Enabled HP-UX9.2 TS2M39.4 TS1M3
64-bit Enabled Solaris9.2 TS2M39.4 TS1M3
HP-UX IPF9.2 TS2M39.4 TS1M3
Linux9.2 TS2M39.4 TS1M3
Linux for x649.2 TS2M39.4 TS1M3
OpenVMS on HP Integrity9.2 TS2M39.4 TS1M3
Solaris for x649.2 TS2M39.4 TS1M3
Microsoft Windows 2000 Advanced Server9.1 TS1M0
Microsoft Windows 2000 Datacenter Server9.1 TS1M0
Microsoft Windows 2000 Server9.1 TS1M0
Microsoft Windows 2000 Professional9.1 TS1M0
Microsoft Windows NT Workstation9.1 TS1M0
OpenVMS Alpha9.1 TS1M09.4 TS1M3
Tru64 UNIX9.1 TS1M09.4 TS1M3
Z649.4 TS1M29.4 TS1M3
Microsoft Windows 8 Enterprise 32-bit9.4 TS1M29.4 TS1M3
Microsoft Windows 8 Enterprise x649.4 TS1M29.4 TS1M3
Microsoft Windows 8 Pro 32-bit9.4 TS1M29.4 TS1M3
Microsoft Windows 8 Pro x649.4 TS1M29.4 TS1M3
Microsoft Windows 8.1 Enterprise 32-bit9.4 TS1M29.4 TS1M3
Microsoft Windows 8.1 Enterprise x649.4 TS1M29.4 TS1M3
Microsoft Windows 8.1 Pro9.4 TS1M29.4 TS1M3
Microsoft Windows 8.1 Pro 32-bit9.4 TS1M29.4 TS1M3
Microsoft Windows Server 2012 Datacenter9.4 TS1M29.4 TS1M3
Microsoft Windows Server 2012 R2 Datacenter9.4 TS1M29.4 TS1M3
Microsoft Windows Server 2012 R2 Std9.4 TS1M29.4 TS1M3
Microsoft Windows Server 2012 Std9.4 TS1M29.4 TS1M3
* For software releases that are not yet generally available, the Fixed Release is the software release in which the problem is planned to be fixed.